CODE SENSOR ATTACHMENT FOR PEN 

FIELD OF INVENTION 

The present invention relates generally to a sensing device for capturing 

interactions with and movement relative to a surface, and more particularly, to a sensing 
device attachable to a stylus or marking implement such as a pen, pencil or eraser. 
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BACKGROUND 

Self-contained pen-like devices have been described which mark a sxirface 
while simultaneously capturing their own movement relative to the surface. Some such 
devices sense their own movement using embedded accelerometers. Others sense their 
5 own movement by detecting information which is machine-readably encoded on the 
surface. In this latter category, see for example US Patent 5,477,012, US Patent 
5,652,412 and PCT application WO99/50787. 

Since these pen-like devices typically include an integral marking means, they 
support only a limited set of marking modes or styles. 

1 0 SUMMARY OF INVENTION 

In accordance with a first aspect of the present invention, there is provided a 

sensing device for use with a surface having coded data disposed on the surface, the 
coded data being indicative of an identity of a region associated with the surface, said 
device including detection means arranged to detect the coded data and to generate 
1 5 region identity data indicative of the identity of the region using the coded data, and 
attachment means for facilitating attachment of the device to a writing implement. 

Preferably, the attachment means is adapted to facilitate attachment and 
detachment of the device to a writing implement. 

Preferably, the attachment means is a clamp. 

20 Preferably also, the device includes means for ensuring correct orientation of 

the device when the writing implement is held by a user during use. Said means for 
ensuring correct orientation of the device may be a grip portion configured so as to 
correspond with a portion of a user's hand. 

Preferably, the sensing device includes calibration means for calibrating the 
25 device such that information indicative of the distance between a writing portion of the 
writing implement and the detection means is incorporated into said region identity data. 

Preferably, the attachment means is adapted to facilitate attachment of the 
device to a pen, marker or pencil. 

Preferably, the sensing device includes motion sensing means configured to 
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generate movement data indicative of movement of the sensing device relative to the 
region. 

Preferably, the motion sensing means is configured to generate the movement 
data using the coded data. 

5 In an altemative embodiment, the motion sensing means includes at least one 

acceleration sensing means, the acceleration sensing means being configured to sense 
acceleration of the sensing device as the sensing device moves relative to the region, the 
motion sensing means being configured to generate the movement data by periodically 
sampling the acceleration. 

1 0 Preferably, the acceleration sensing means is configured to sense at least two 

substantially orthogonal components of acceleration. 

Preferably, the coded data includes a plurality of tags, each tag being indicative 
of an identity of a region within which the tag Ues. 

In accordance with a second aspect of the present invention, there is provided a 
15 system for capturing information applied freehand, said system including a sensing 
device as described above, and a surface having coded data associated with the surface. 

Features and advantages of the present invention will become apparent fi-om 
the following description of embodiments thereof, by way of example only, with 
reference to the accompanying drawings. 

20 
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BRIEF DESCRIPTION OF DRAWINGS 

Preferred and other embodiments of the invention will now be described, by 

way of non-limiting example only, with reference to the accompanying drawings, in 
which: 

5 Figure 1 is a schematic of a the relationship between a sample printed netpage and its 
online page description; 

Figure 2 is a schematic view of a interaction between a pen with an attached netpage 
code sensor attachment, a netpage printer, a netpage page server, and a netpage 
application server; 

1 0 Figure 3 is a schematic view of a high-level structure of a printed netpage and its online 
page description; 

Figure 4a is a plan view showing a structure of a netpage tag; 

Figure 4b is a plan view showing a relationship between a set of the tags shovra in Figure 
4a and a field of view of a netpage sensing device; 

1 5 Figure 5a is a plan view showing an altemative structure of a netpage tag; 

Figure 5b is a plan view showing a relationship between a set of the tags shown in Figure 
5a and a field of view of a netpage sensing device; 

Figure 5c is a plan view showing an arrangement of nine of the tags shown in Figure 5a 
where targets are shared between adjacent tags; 

20 Figure 5d is a plan view showing the interleaving and rotation of the symbols of the four 
codewords of the tag shown in Figure 5a; 

Figure 6 is a schematic view of a tag image processing and decoding algorithm; 

Figure 7 is a perspective view of a netpage code, sensor attachment according to the 
present invention, attached to a pen; 

25 Figure 8 is a perspective view of the netpage code sensor attachment attached to a 
capped pen; 

Figure 9 is a perspective view of the netpage code sensor attachment; 

Figure 10 is a perspective view of the underside of the netpage code sensor attachment; 
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Figure 1 1 is an exploded view of the netpage code sensor attachment; 
Figure 12 is a cross section of the netpage code sensor attachment; 

Figure 13 is a perspective view of the PCB and optics block of the netpage code sensor 
attachment; 

5 Figuire 14 is a schematic block diagram of a controller for the netpage code sensor 
attachment; 

Figure 15 is a schematic view of a netpage code sensor attachment optical path; 

Figure 16 is a flowchart of a stroke capture algorithm; 

Figure 17 is a schematic view of a raw digital ink class diagram; 

10 Figure 18 is a perspective view of the netpage code sensor attachment housed in a base 
according to the present invention; 

Figure 19 is a plan view of the netpage code sensor attachment housed in the base; 

Figure 20 is a side elevation of the netpage code sensor attachment housed in the base; 

Figure 21 is a front elevation of the netpage code sensor attachment housed in the base; 

1 5 Figure 22 is a cross section of the netpage code sensor attachment and base, with the 
attachable sensing device housed in the base; 

Figure 23 is a schematic view of the calibration of point nib geometry; and 
Figure 24 is a schematic view of the calibration of spherical nib geometry. 
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DETAILED DESCRIPTION OF PREFERRED AND OTHER EMBODIMENTS 

Note: MemjeF^ is a trademark of Silverbrook Research Pty Ltd, Australia. 



Tnfth^ prflffrrH ^n-»1->r^Himrti| lll^ iny^r>Tion ^e i^nn^^^^rf>A tn wr^rV with thc 

netpage networked computer system, a summary of which is given below and ajl©t^iled 
description of which is given in our co-pending applications, inclu^iftgin particular 

applications USSN 09/ (docket no. NPS027U3>?<JSSN 09/ 

(docket no. NPP023US), USSN 09/ C^l0dIcet no. NPT002US) and USSN 

09/ (docket no. USIUS}^^--!! will be appreciated that not every 

implementation will necessarjjy^'^body all or even most of the specific details and 
1 0 extensions described^mr'tnese applications in relation to the basic system. However, the 
system is dpsc5nbed in its most complete form to assist in understanding the context in 



\Arhj^nhf-. prefi ' 1 1 r d i^i in hn rliinrnT fsand aspects of the present invention operate; — 

In brief summary, the preferred form of the netpage system employs a computer 
interface in the form of a mapped surface, that is, a physical surface which contains 

1 5 references to a map of the surface maintained in a computer system. The map references 
can be queried by an appropriate sensing device. Depending upon the specific 
implementation, the map references may be encoded visibly or invisibly, and defined in 
such a way that a local query on the mapped surface yields an unambiguous map 
reference both within the map and among different maps. The computer system can 

20 contain information about features on the mapped surface, and such information can be 
retrieved based on map references supplied by a sensing device used with the mapped 
surface. The information thus retrieved can take the form of actions which are initiated 
by the computer system on behalf of the operator in response to the operator's interaction 
with the surface features. 

25 Li its preferred form, the netpage system relies on the production of, and human 

interaction with, netpages. These are pages of text, graphics and images printed on 
ordinary paper or other media, but which work like interactive web pages. Information is 
encoded on each page using ink which is substantially invisible to the unaided human 
eye. The ink, however, and thereby the coded data, can be sensed by an optically imaging 

30 pen and transmitted to the netpage system. 

In the preferred form, active buttons and hyperlinks on each page can be 
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clicked with the pen to request information from the network or to signal preferences to a 
network server. In one embodiment, text written by hand on a netpage is automatically 
recognized and converted to computer text in the netpage system, allowing forms to be 
filled in. In other embodiments, signatures recorded on a netpage are automatically 
5 verified, allowing e-commerce transactions to be securely authorized. 

As illustrated in Figure 1, a printed netpage 1 can represent a interactive form 
which can be filled in by the user both physically, on the printed page, and 
"electronically, via commimication between the pen and the netpage system. The 
example shows a "Request" form containing name and address fields and a submit 

1 0 button. The netpage consists of graphic data 2 printed using visible ink, and coded data 3 
printed as a collection of tags 4 using invisible ink. The corresponding page description 
5, stored on the netpage network, describes the individual elements of the netpage. In 
particular it describes the type and spatial extent (zone) of each interactive element (i.e. 
text field or button in the example), to allow the netpage system to correctly interpret 

1 5 input via the netpage. The submit button 6, for example, has a zone 7 which corresponds 
to the spatial extent of the corresponding graphic 8. 

illustrated m t^igure^Zrtite-iie^age^ ien lOL a preferred fo rm of which is^ 

described"in our co-pending application USSN 09/ (docket no. NPSO^ltfS), 

works in conjunction with a netpage printer 601, an Intemet-connept^ printing 
20 appliance for home, office or mobile use. The pen is wireless and corjHfiunicates securely 
with the netpage printer via a short-range radio link 9. 

The netpage printer 601, preferred forms gf^hich are described in our co- 
pending appUcations USSN 09/ ^<^ocket no. NPP023US) and USSN 

09/ (docket no. NPS024US)^..is able to deliver, periodically or on demand, 

25 personalized newspapers, magaziij^^; catalogs, brochures and other publications, all 
printed at high quality as intep^ctive netpages. Unlike a personal computer, the netpage 
printer is an appliancc/^ich can be, for example, wall-mounted adjacent to an area 
where the momiivg^ews is first consumed, such as in a user's kitchen, near a breakfast 
table, or npaf^ the household's point of departure for the day. It also comes in tabletop, 
30 desktg^ port able and miniature versions. 




Netpages printed at their point of consumption combine the ease-of-use of 
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paper with the timeliness and interactivity of an interactive medium. 

As shown in Figure 2, the netpage pen 101 interacts with the coded data on a 
printed netpage 1 and communicates, via a short-range radio link 9, the interaction to a 
netpage printer. The printer 601 sends the interaction to the relevant netpage page server 
10 for interpretation. Li appropriate circumstances, the page server sends a corresponding 
message to application computer software running on a netpage application server 13. 
The application server may in turn send a response which is printed on the originating 
printer. 



Jg^-^ietpage iiystem is tnaae consideiably more convenient in the prefeire^ 
embodiment by being used in conjxmction with high-speed microelectcdfilSchanical 
system (MEMS) based Inkjet (Memjet™) printers, for ex^iple-'^^escribed in our co- 
pending application USSN 09/ (do£ket'tIoriJ52US). In the preferred form of 

this technology, relatively high-sp5^d--^id high-quality printing is made more affordable 
to consumers. In its prefeH:€aform, a netpage publication has the physical characteristics 
of a traditiojiaHiewsmagazine, such as a set of letter-size glossy pages printed in full 

CQlef^n both side f >, hmmd tngr^thor fui i As.y i rA\ji^'A \ \ii n and rnmfortahlp Kari7!1tng^ 

The netpage printer exploits the growing availability of broadband Intemet 
access. The netpage printer can also operate with slower connections, but with longer 
delivery times and lower image quality. The netpage system can also be enabled using 
existing consumer Inkjet and laser printers, although the system will operate more slowly 
and will therefore be less acceptable from a consumer's point of view. In other 
embodiments, the netpage system is hosted on a private intranet. In still other 
embodiments, the netpage system is hosted on a single computer or computer-enabled 
device, such as a printer. 

Netpage publication servers 14 on the netpage network are configured to 
deliver print-quality publications to netpage printers. Periodical publications are 
delivered automatically to subscribing netpage printers via pointcasting and multicasting 
Intemet protocols. Personalized publications are filtered and formatted according to 
individual user profiles. 

A netpage printer can be configured to support any number of pens, and a pen 
can work with any number of netpage printers. In the preferred implementation, each 
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netpage pen has a unique identifier. A household may have a collection of colored 
netpage pens, one assigned to each member of the family. This allows each user to 
maintain a distinct profile with respect to a netpage publication server or application 
server. 

5 A netpage pen can also be registered with a netpage registration server 1 1 and 

linked to one or more payment card accounts. This allows e-commerce payments to be 
securely authorized using the netpage pen. The netpage registration server compares the 
signature captured by the netpage pen with a previously registered signature, allowing it 
to authenticate the user's identity to an e-commerce server. Other biometrics can also be 
10 used to verify identity. A version of the netpage pen includes fingerprint scanning, 
verified in a similar way by the netpage registration server. 

Although a netpage printer may deliver periodicals such as the morning 
newspaper without user intervention, it can be configured never to deliver xmsolicited 
junk mail. In its preferred form, it only delivers periodicals fi*om subscribed or otherwise 
1 5 authorized sources. In this respect, the netpage printer is unlike a fax machine or e-mail 
accoimt which is visible to any jimk mailer who knows the telephone number or e-mail 
address. 

Each object model in the system is described using a Unified Modeling 
Language (UML) class diagram. A class diagram consists of a set of object classes 

20 connected by relationships, and two kinds of relationships are of interest here: 
associations and generalizations. An association represents some kind of relationship 
between objects, i.e. between instances of classes. A generalization relates actual classes, 
and can be imderstood in the following way: if a class is thought of as the set of all 
objects of that class, and class A is a generalization of class B, then B is simply a subset 

25 of A. Each class is drawn as a rectangle labelled with the name of the class. It contains a 
list of the attributes of the class, separated fi-om the name by a horizontal line, and a list 
of the operations of the class, separated firom the attribute list by a horizontal line. In the 
class diagrams which follow, however, operations are never modelled. An association is 
drawn as a line joining two classes, optionally labelled at either end with the multiplicity 

30 of the association. The default multiplicity is one. An asterisk (*) indicates a multiplicity 
of "many", i.e. zero or more. Each association is optionally labelled with its name, and is 
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also optionally labelled at either end with the role of the corresponding class. An open 
diamond indicates an aggregation association ("is-part-of and is drawn at the 
aggregator end of the association line. A generalization relationship ("is-a") is drawn as a 
solid line joining two classes, with an arrow (in the form of an open triangle) at the 
5 generalization end. When a class diagram is broken up into multiple diagrams, any class 
which is duplicated is shown with a dashed outline in all but the main diagram which 
defines it. It is shown with attributes only where it is defined. 

Netpages are the foundation on which a netpage network is built. They provide 
a paper-based user interface to published information and interactive services, A netpage 

1 0 consists of a printed page (or other surface region) invisibly tagged with references to an 
online description of the page. The online page description is maintained persistently by 
a netpage page server. The page description describes the visible layout and content of 
the page, including text, graphics and images. It also describes the input elements on the 
page, including buttons, hyperlinks, and input fields. A netpage allows markings made 

1 5 with a netpage pen on its surface to be simultaneously captured and processed by the 
netpage system. 

Multiple netpages can share the same page description. However, to allow 
input through otherwise identical pages to be distinguished, each netpage is assigned a 
unique page identifier. This page ID has sufficient precision to distinguish between a 
20 very large number of netpages. 

Each reference to the page description is encoded in a printed tag. The tag 
identifies the unique page on which it appears, and thereby indirectly identifies the page 
description. The tag also identifies its own position on the page. Characteristics of the 
tags are described in more detail below. 

25 Tags are printed in infirared-absorptive ink on any substrate which is infirared- 

reflective, such as ordinary paper. Near-infirared wavelengths are invisible to the human 
eye but are easily sensed by a solid-state image sensor with an appropriate filter. 

A tag is sensed by an area image sensor in the netpage pen, and the tag data is 
transmitted to the netpage system via the nearest netpage printer. The pen is wireless and 
30 communicates with the netpage printer via a short-range radio link. Tags are sufficiently 
small and densely arranged that the pen can reliably image at least one tag even on a 
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single click on the page. It is important that the pen recognize the page ID and position 
on every interaction with the page, since the interaction is stateless. Tags are error- 
correctably encoded to make them partially tolerant to surface damage. 

The netpage page server maintains a unique page instance for each printed 
5 netpage, allowing it to maintain a distinct set of user-supplied values for input fields in 
the page description for each printed netpage. 

The relationship between the page description, the page instance, and the 
printed netpage is shown in Figure 3. The printed netpage may be part of a printed 
netpage document 45. The page instance is associated with both the netpage printer 
1 0 which printed it and, if known, the netpage user who requested it. 

In a preferred form, each tag identifies the region in which it appears, and the 
location of that tag within the region. A tag may also contain flags which relate to the 
region as a whole or to the tag. One or more flag bits may, for example, signal a tag 
sensing device to provide feedback indicative of a function associated with the 
1 5 immediate area of the tag, without the sensing device having to refer to a description of 
the region. A netpage pen may, for example, illuminate an "active area" LED when in 
the zone of a hyperlink. 

In a preferred embodiment, each tag contains an easily recognized invariant 
structure which aids initial detection, and which assists in minimizing the effect of any 
20 warp induced by the surface or by the sensing process. The tags preferably tile the entire 
page, and are sufficiently small and densely arranged that the pen can reliably image at 
least one tag even on a single click on the page. It is important that the pen recognize the 
page ID and position on every interaction with the page, since the interaction is stateless. 

In a preferred embodiment, the region to which a tag refers coincides with an 
25 entire page, and the region ID encoded in the tag is therefore synonymous with the page 
ID of the page on which the tag appears. In other embodiments, the region to which a tag 
refers can be an arbitrary subregion of a page or other surface. For example, it can 
coincide with the zone of an interactive element, in which case the region ID can directly 
identify the interactive element. 

30 Each tag contains typically contains 16 bits of tag ID, at least 90 bits of region 
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JD, and a number of flag bits. Assuming a maximum tag density of 64 per square inch, a 
16-bit tag ID supports a region size of up to 1024 square inches. Larger regions can be 
mapped continuously without increasing the tag ID precision simply by using abutting 
regions and maps. The distinction between a region ID and a tag ED is mostly one of 
5 convenience. For most purposes the concatenation of the two can be considered as a 
globally imique tag ID. Conversely, it may also be convenient to introduce structure into 
the tag ID, for example to define the x and y coordinates of the tag. A 90-bit region ID 
allows 2^^ (--10^^ or a thousand trillion trillion) different regions to be imiquely 
identified. Tags may also contain type information, and a region may be tagged with a 
1 0 mixture of tag types. For example, a region may be tagged with one set of tags encoding 
X coordinates and another set, interleaved with the first, encoding y coordinates. 

In one embodiment, 120 bits of tag data are redxmdantly encoded using a (15, 
5) Reed-Solomon code. This yields 360 encoded bits consisting of 6 codewords of 15 4- 
bit symbols each. The (15, 5) code allows up to 5 symbol errors to be corrected per 

1 5 codeword, i.e. it is tolerant of a symbol error rate of up to 33% per codeword. Each 4-bit 
symbol is represented in a spatially coherent way in the tag, and the symbols of the six 
codewords are interleaved spatially within the tag. This ensures that a burst error (an 
error affecting multiple spatially adjacent bits) damages a minimum number of symbols 
overall and a minimum number of symbols in any one codeword, thus maximising the 

20 likelihood that the burst error can be fully corrected. 

Any suitable error-correcting code code can be used in place of a (15, 5) Reed- 
Solomon code, for example a Reed-Solomon code with more or less redxmdancy, with 
the same or different symbol and codeword sizes; another block code; or a different kind 
of code, such as a convolutional code (see, for example, Stephen B. Wicker, Error 
25 Control Systems for Digital Commimication and Storage, Prentice-Hall 1995, the 
contents of which a herein incorporated by cross-reference). 

rm bodiii K ill Mflln [iliyi l r nl rrprr-rnrnTion or Tnrtn^;:7^o ^ ^rn in F ij Tjre 4a 

and described in our co-pending application US SN_ O^ Z ""Tdocket no. 

NPT002US), includes fixed targetj5fe=ttctTffesT^57l6, 17 and variable data areas 18. The 
30 fixed target structurgs-attow a sensing device such as the netpage pen to detect the tag 
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data bit is represented by a radial wedge in the form of an area bounded bjM:^ radial 
lines and two concentric circular arcs. Each wedge has a mininiup^-dlmension of 8 dots 
at 1600 dpi and is designed so that its base (its inngp-^fc), is at least equal to this 
5 minimum dimension. The height of the we^ge^tithe radial direction is always equal to 
the minimum dimension. Eadi>H5itdata symbol is represented by an array of 2x2 
wedges. The fifteen 4;jMf^ta symbols of each of the six codewords are allocated to the 
four concenta>0^^gymbol rings 18a to 18d in interleaved fashion. Symbols are allocated 
altem^t^ in circular progression around the tag. The interleaving is designed to 

10 ^- rfmximi ^ ^p-t: pafigl Higffln r>^ It^ l uy^H ii Mt i y tivn rjrmhnin nf thf^nmn rn d eword. , 

Li order to support "single-click" interaction with a tagged region via a sensing 
device, the sensing device must be able to see at least one entire tag in its field of view 
no matter where in the region or at what orientation it is positioned. The required 
diameter of the field of view of the sensing device is therefore a function of the size and 
1 5 spacing of the tags. Assuming a circular tag shape, the minimum diameter of the sensor 
field of view 193 is obtained when the tags are tiled on a equilateral triangular grid, as 
shown in Figure 4b. 

The tag image processing and decoding performed by a sensing device such as 
the netpage pen is shown in Figure 6. While a captured image is being acquired fi"om the 
20 image sensor, the dynamic range of the image is determined (at 20). The center of the 
range is then chosen as the binary threshold for the image 21. The image is then 
thresholded and segmented into connected pixel regions (i.e. shapes 23) (at 22). Shapes 
which are too small to represent tag target structures are discarded. The size and centroid 
of each shape is also computed. 

25 Binary shape moments 25 are then computed (at 24) for each shape, and these 

provide the basis for subsequently locating target structures. Central shape moments are 
by their nature invariant of position, and can be easily made invariant of scale, aspect 
ratio and rotation. 

The ring target structure 15 is the first to be located (at 26). A ring has the 
30 advantage of being very well behaved when perspective-distorted. Matching proceeds by 
aspect-normalizing and rotation-normalizing each shape's moments. Once its second- 
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order moments are normalized the ring is easy to recognize even if the perspective 
distortion was significant. The ring's original aspect and rotation 27 together provide a 
useful approximation of the perspective transform. 

The axis target structure 16 is the next to be located (at 28). Matching proceeds 
5 by applying the ring's normalizations to each shape's moments, and rotation-normalizing 
the resulting moments. Once its second-order moments are normalized the axis target is 
easily recognized. Note that one third order moment is required to disambiguate the two 
possible orientations of the axis. The shape is deliberately skewed to one side to make 
this possible. Note also that it is only possible to rotation-normalize the axis target after 
10 it has had the ring's normalizations applied, since the perspective distortion can hide the 
axis target's axis. The axis target's original rotation provides a useful approximation of 
the tag's rotation due to pen yaw 29. 

The four perspective target structures 17 are the last to be located (at 30). Good 
estimates of their positions are computed based on their known spatial relationships to 

1 5 the ring and axis targets, the aspect and rotation of the ring, and the rotation of the axis. 
Matching proceeds by applying the ring's normalizations to each shape's moments. Once 
their second-order moments are normalized the circular perspective targets are easy to 
recognize, and the target closest to each estimated position is taken as a match. The 
original centroids of the four perspective targets are then taken to be the perspective- 

20 distorted comers 3 1 of a square of known size in tag space, and an eight-degree-of- 
freedom perspective transform 33 is inferred (at 32) based on solving the well- 
understood equations relating the four tag-space and image-space point pairs (see 
Heckbert, P., Fundamentals of Texture Mapping and Image Warping, Masters Thesis, 
Dept. of EECS, U. of California at Berkeley, Technical Report No. UCB/CSD 89/516, 

25 June 1989, the contents of which are herein incorporated by cross-reference). 

The inferred tag-space to image-space perspective transform is used to project 
(at 36) each known data bit position in tag space into image space where the real- valued 
position is used to bilinearly interpolate (at 36) the four relevant adjacent pixels in the 
input image. The previously computed image threshold 21 is used to threshold the result 
30 to produce the final bit value 37. 

Once all 360 data bits 37 have been obtained in this way, each of the six 60-bit 
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Reed-Solomon codewords is decoded (at 38) to yield 20 decoded bits 39, or 120 decoded 
bits in total. Note that the codeword symbols are sampled in codeword order, so that 
codewords are implicitly de-interleaved during the sampling process. 

The ring target 1 5 is only sought in a subarea of the image whose relationship 
5 to the image guarantees that the ring, if foxmd, is part of a complete tag. If a complete tag 
is not found and successfully decoded, then no pen position is recorded for the current 
frame. Given adequate processing power and ideally a non-minimal field of view 193, an 
altemative strategy involves seeking another tag in the current image. 

The obtained tag data indicates the identity of the region containing the tag and 
10 the position of the tag within the region. An accurate position 35 of the pen nib in the 
region, as well as the overall orientation 35 of the pen, is then inferred (at 34) from the 
perspective transform 33 observed on the tag and the known spatial relationship between 
the pen's physical axis and the pen's optical axis. 

The tag structure just described is designed to allow both regular tilings of 
15 planar surfaces and irregular tilings of non-planar surfaces. Regular tilings are not, in 
general, possible on non-planar surfaces. In the more usual case of planar surfaces where 
regular tilings of tags are possible, i.e. surfaces such as sheets of paper and the like, more 
efficient tag structures can be used which exploit the regular nature of the tiling. 

An altemative tag structure more suited to a regular tiling is shown in Figure 
20 5a. The tag 4 is square and has foxir perspective targets 17. It is similar in stracture to 
tags described by Bennett et al. in US Patent 5,051,746. The tag represents sixty 4-bit 
Reed-Solomon symbols 47, for a total of 240 bits. The tag represents each one bit as a 
dot 48, and each zero bit by the absence of the corresponding dot. The perspective targets 
are designed to be shared between adjacent tags, as shovra in Figures 5b and 5c. Figure 
25 5b shows a square tiling of 16 tags and the corresponding minimum field of view 193, 
which must span the diagonals of two tags. Figure 5c shows a square tiling of nine tags, 
containing all one bits for illustration purposes. 

Using a (15, 7) Reed-Solomon code, 112 bits of tag data are redundantly 
encoded to produce 240 encoded bits. The four codewords are interleaved spatially 
30 within the tag to maximize resilience to burst errors. Assuming a 16-bit tag LD as before, 
this allows a region ID of up to 92 bits. The data-bearing dots 48 of the tag are designed 
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to not overlap their neighbors, so that groups of tags cannot produce structures which 
resemble targets. This also saves ink. The perspective targets therefore allow detection of 
the tag, so further targets are not required. Tag image processing proceeds as described 
above, with the exception that steps 26 and 28 are omitted. 

5 Although the tag may contain an orientation feature to allow disambiguation of 

the four possible orientations of the tag relative to the sensor, it is also possible to embed 
orientation data in the tag data. For example, the four codewords can be arranged so that 
each tag orientation contains one codeword placed at that orientation, as shown in Figure 
5d, where each symbol is labelled with the number of its codeword (1-4) and the position 

10 of the symbol within the codeword (A-O). Tag decoding then consists of decoding one 
codeword at each orientation. Each codeword can either contain a single bit indicating 
whether it is the first codeword, or two bits indicating which codeword it is. The latter 
approach has the advantage that if, say, the data content of only one codeword is 
required, then at most two codewords need to be decoded to obtain the desired data. This 

1 5 may be the case if the region ID is not expected to change within a stroke and is thus 
only decoded at the start of a stroke. Within a stroke only the codeword containing the 
tag ED is then desired. Furthermore, since the rotation of the sensing device changes 
slowly and predictably within a stroke, only one codeword typically needs to be decoded 
perfi-ame. 

20 It is possible to dispense with perspective targets altogether and instead rely on 

the data representation being self-registering. In this case each bit value (or multi-bit 
value) is typically represented by an explicit glyph, i.e. no bit value is represented by the 
absence of a glyph. This ensures that the data grid is well-populated, and thus allows the 
grid to be reliably identified and its perspective distortion detected and subsequently 

25 corrected during data sampling. To allow tag boundaries to be detected, each tag data 
must contain a marker pattern, and these must be redundantly encoded to allow reliable 
detection. The overhead of such marker pattems is similar to the overhead of explicit 
perspective targets. One such scheme uses dots positioned a various points relative to 
grid vertices to represent different glyphs and hence different multi-bit values (see Anoto 

30 Technology Description, Anoto April 2000). 

Decoding a tag results in a region ID, a tag ID, and a tag-relative pen transform. 
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Before the tag ED and the tag-relative pen location can be translated into an absolute 
location within the tagged region, the location of the tag within the region must be 
known. This is given by a tag map, a function which maps each tag ID in a tagged region 
to a corresponding location. A tag map reflects the scheme used to tile the siirface region 
5 with tags, and this can vary according to sxirface type. When multiple tagged regions 
share the same tiling scheme and the same tag numbering scheme, they can also share 
the same tag map. The tag map for a region must be retrievable via the region ID. Thus, 
given a region ID, a tag ID and a pen transform, the tag map can be retrieved, the tag ID 
can be translated into an absolute tag location within the region, and the tag-relative pen 
10 location can be added to the tag location to yield an absolute pen location within the 
region. 



It may, for example, encoded cartesian coordinates or polar coordinates, depending on 
the surface type on which it appears. The tag ID structure is dictated by and known to the 
1 5 tag map, and tag IDs associated with different tag maps may therefore have different 
structures. 



stmcture described earlier in this section. The preferred coding scheme uses "location- 
indicating" tags as already discussed. An alternative coding scheme uses "object- 
20 indicating" (or "function-indicating") tags. 



tag map associated with the tagged region, yields a xmique tag location within the region. 
The tag-relative location of the pen is added to this tag location to yield the location of 
the pen within the region. This in tum is used to determine the location of the pen 
25 relative to a user interface element in the page description associated with the region. 
Not only is the user interface element itself identified, but a location relative to the user 
interface element is identified. Location-indicating tags therefore trivially support the 
capture of an absolute pen path in the zone of a particular user interface element. 



The tag ID may have a structure which assists translation through the tag map. 



Two distinct surface coding schemes are of interest, both of which use the tag 



A location-indicating tag contains a tag LD which, when translated through the 




icatiiig (Of hmction-mdic^ ting) lag contains — a^ tag ID which 



mterface element in the page description associated with the 
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identt^g-^lie^ ^ser interfa ce element, making them all identical and therefore 
indistinguishable. Object-indicating tags do not, therefore, support the capture>»f^ 
absolute pen path. They do, however, support the capture of a relatiye-peffpath. So long 
as the position sampling frequency exceeds twicelbe--'eficountered tag frequency, the 
5 displacement from one sampled pen^^perStfion to the next within a stroke can be 
unambiguously determingdr-A^^ altemative, the netpage pen 101 can contain a pair or 
motion-sensi3ig-''''Sccelerometers, as described in our co-pending application USSN 

An embodiment of the present invention, in the form of a sensing device for 
1 0 coded surfaces, attachable to a stylus or a marking device such as a pen, pencil or eraser, 
will now be described. The attachable sensing device is hereinafter referred to as a code 
sensor attachment or code sensor, as appropriate. 

The code sensor attachment, when attached to an implement such as a stylus or 
pen, effectively transforms the implement into a netpage stylus or pen. The code sensor 
15 attachment senses the force exerted on the surface by the implement's nib, senses the 
movement of the implement relative to a netpage-encoded surface, and transmits the 
resulting digital ink to a netpage base station. 

A first embodiment of the present invention will now be described with 
reference to Figures 7, 8, 9, 10, 11, 12 and 13. The code sensor attachment, generally 

20 designated by reference numeral 1010, includes a housing 1026, in the form of a plastic 
molding, which defines an interior space for moimting code sensor components. The 
housing 1026 also has projecting grip areas 1018 which allow a user to comfortably grip 
the code sensor. The housing 1026 receives a chassis molding 1032 which in tum 
receives, via opposed slots, a PCB 1034. A cover 1028 is secured to the end of the 

25 housing 1026. The cover 1028 is also of molded plastics, and is formed from semi- 
transparent material to allow the user to view status LEDs 116 mounted at the end of the 
PCB 1034. The cover 1028 also provides an aperture for the optics block 135 mounted 
on the PCB 1034. 

A metal clip 1020 is secured to the housing 1026 at the opposite end to the 
30 cover 1028. The clip is held in place by a pre-stressed flexing metal arm 1024 which 
engages a slot on the underside of the housing 1026. The metal clip 1020 flexibly grips 
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an implement such as a stylus or pen. A rubber pad 1030 is attached to the interior 
surface of the metal clip 1020, providing a highly frictional and somewhat compressive 
seat for the held implement. The metal clip 1020 accommodates a range of implement 
diameters. 

5 Any number of alternative methods may be used to attach the code sensor to 

the implement. For example, an elastic weave strap with Velcro fastening; a pivoting or 
separate clamp plate securable via a nut and bolt assembly; a pair of pivoting and sprung 
clamp plates; or a jubilee clip arrangement securable via a tightening screw. 

When the code sensor attachment is attached to an implement, the user grips 
10 the code sensor by the grip areas 1018, i.e. typically between thumb and forefinger. The 
projecting grip areas 1018 of the housing 1026 define an interior space large enough to 
accommodate the implement without contact. The implement, which is held by the metal 
clip 1020, is allowed to pivot slightly with respect to the housing 1026 when its nib is 
pressed against a surface, by virtue of the flexibility of the flexing metal arm 1024 of the 
15 metal clip 1020. A finger 1022 projecting fi-om the metal clip thereby moves relative to 
the end of the PCB 1034, where an LED 143 illuminates it and a photodiode 144 
captures the intensity of the light it reflects. While the interior of finger's surrounding 
cavity has low reflectivity, the finger is coated to make it highly diffiisely reflective. The 
intensity of light reflected by the finger gives an indication of the displacement of the 
20 finger and thereby gives an indication of the force exerted on the nib. The finger 1022 
has a triangular profile so that the reflected light intensity varies linearly with 
displacement. 

An infirared LED 131, moimted on the PCB 1034, provides infi-ared 
illumination of netpage tags on the surface, while an image sensor 132, also mounted on 

25 the PCB, images the tags. A radio fi"equency (RF) chip 133, containing analog RF 
transceiver components, and a controller chip 1 34 for controlling operation of the code 
sensor, are also mounted on the PCB. An optics block 135, formed fi*om molded clear 
plastics, is mounted on the PCB on top of the infi*ared illumination LED 131 and the 
image sensor 132. It projects illumination firom the LED 131 onto the surface and 

30 projects images of the surface onto the image sensor 132. A replaceable 3V battery 1036 
is seated on the PCB. A flash memory 147, containing controller software, identity and 
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configuration information, and a DRAM 148 of typically 512KB capacity, are also 
mounted on the PCB. 

The code sensor attachment 1010 of this embodiment of the invention is 
specifically arranged to detect coded data recorded on a surface for use in sending 
5 instructions to a computing system. Such coded data includes location tags printed on the 
surface in infrared absorptive ink. Each printed location tag includes encoded region 
identity data which identifies an identity of a region on the surface in which the tag is 
printed (e.g. if the tag is printed on a sheet of paper this data may identify the sheet of 
paper), and location data identifying a location of the tag within the region. The location 
1 0 tags also include target structures which allow their detection and enable calculation of 
the three-dimensional orientation of the pen relative to the surface on which the tag is 
printed (e.g. tilt with respect to the surface). The tag may also contain various control 
data. 

The code sensor attachment 1010 is arranged to determine the position of the 
15 image sensor 132, and thereby the position of the nib, by imaging, in the infrared 
spectrum, an area of the surface in the vicinity of the nib. It records the location data 
from the nearest location tag, and is arranged to calculate the distance of the image 
sensor 132, and thereby the distance of the nib, from an imaged location tag. The 
controller chip 134 calculates the orientation of the pen and the nib-to-tag distance from 
20 the perspective distortion observed on the imaged tag. 

Control data from the location tag may include control bits instructing the code 
sensor to activate its "active area" LED (one of the status LEDs 1 16). Thus, a region on 
the surface which corresponds to the active area of a button or hyperlink may be encoded 
to activate this LED, giving the user of the code sensor visual feedback that the button or 
25 hyperlink is active when the nib passes over it. 

Movements made relative to a surface with an implement which has an 
attached code sensor are typically captured as a series of strokes. A stroke consists of a 
sequence of time-stamped pen positions on the surface, initiated by a pen-down event 
and completed by a subsequent pen-up event. The pen force can be interpreted relative to 
30 a threshold to indicate whether the pen is "up" or "down", or can be interpreted as a 
continuous value, for example when the pen is capturing a signature. The sequence of 
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captured strokes is hereinafter referred to as "digital ink". Digital ink can be used with a 
computing system to form the basis for the digital exchange of drawings and 
handwriting, for on-line recognition of handwriting, and for on-line verification of 
signatures. 

5 Utilizing the RF chip 133 and antenna 112 the code sensor can transmit the 

digital ink data (which is encrypted for security and packaged for efficient transmission) 
to the computing system. 

When the code sensor is in range of a receiver, the digital ink data is typically 
transmitted as it is formed. When the code sensor moves out of range, digital ink data is 
1 0 buffered within the code sensor and can be transmitted later. The DRAM 148, if 512KB, 
can hold digital ink of approximately 12 minutes* duration. 

The various operations of the code sensor attachment 1010 will now be 
described in more detail in the following sections. 

As discussed above, the controller chip 134 is mounted on the PCB 1034. 
15 Figure 14 is a block diagram illustrating in more detail the architecture of the controller 
chip 134. Figure 14 also shows representations of the RF chip 133, the image sensor 132, 
the status LEDs 116, the IR illumination LED 131, the force sensor LED 143, and the 
force sensor photodiode 144. 

The pen controller chip 134 includes a controlling processor 145. Bus 146 
20 enables the exchange of data between components of the controller chip 134. Flash 
memory 147 and a 512 KB DRAM 148 are also included. An analog-to-digital converter 
149 is arranged to convert the analog signal from the force sensor photodiode 144 to a 
digital signal. 

An image sensor interface 152 interfaces with the image sensor 132. A 
25 transceiver controller 153 and baseband circuit 154 are also included to interface with 
the RF chip 133 which includes an RF circuit 155 and RF resonators and inductors 156 
connected to the antenna 112. 

The controlling processor 145 captures and decodes location data from tags 
from the surface via the image sensor 132, monitors the force sensor photodiode 144, 
30 controls the LEDs 116, 131 and 143, and handles short-range radio communication via 
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the radio transceiver 153. It is typically a medium-performance (~40MHz) general- 
purpose RISC processor. 

The processor 145, digital transceiver components (transceiver controller 153 
and baseband circuit 154), image sensor interface 152, flash memory 147 and 512KB 
5 DRAM 148 are integrated in a single controller ASIC 134. Analog RF components (RF 
circuit 155 and RF resonators and inductors 156) are provided in the separate RF chip 
133. 

The image sensor is typically a CCD with an approximate resolution of 
215x215 pixels (such a sensor is produced by Matsushita Electronic Corporation, and is 
1 0 described in a paper by Itakura, K T Nobusada, N Okusenya, R Nagayoshi, and M Ozaki, 
"A 1mm 50k-Pixel IT CCD Image Sensor for Miniature Camera System", IEEE 
Transactions on Electronic Devices, Volt 47, number 1, January 2000, which is 
incorporated herein by reference) with an IR filter. Altematively, it may be a CMOS 
image sensor, and may be integrated with the controller ASIC 134. 

15 The controller ASIC 134 enters a quiescent state after a period of inactivity 

when the code sensor attachment doesn't detect contact with a surface. It incorporates a 
dedicated circuit 150 which monitors the force sensor photodiode 144 and wakes up the 
controller 134 via the power manager 151 on a pen-down event. 

The radio transceiver communicates in the unlicensed 900MHz band normally 
20 used by cordless telephones, or altematively in the unlicensed 2.4GHz industrial, 
scientific and medical (ISM) band, and uses firequency hopping and collision detection to 
provide interference-free communication. 

As discussed above, the optics are implemented by a molded optics block 135. 
The optics implemented by the optics block 135 are illustrated schematically in Figure 
25 15. The optics block comprises a first lens 157 for focusing illumination from the 
infrared LED 131, a mirror 158, a beam splitter 159, an objective lens 160 and a second 
lens 161 for focusing an image onto the image sensor 132. Axial rays 162 illustrate the 
optical path. 

The optical path is designed to deliver a sharp image to the image sensor 132 of 
30 that part of the imaged surface which intersects the field of view cone, within required 
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tilt ranges. The primary focusing element is the objective lens 160. This is also used in 
reverse to project illumination from the IR illumination LED 131 onto the surface within 
the field of view. Since it is impractical to place both the image sensor 132 and the IR 
LED 131 at the focus of the objective, a beam splitter 159 is used to split the path and 
5 separate relay lenses 157 and 161 in each path provides refocusing at the image sensor 
132 and the IR LED 131 respectively. This also allows different apertures to be imposed 
on the two paths. Altematively, separate optical paths may be provided for illumination 
and image capture. 

The edges of the image sensor 132 act as the field stop for the capture field, and 
1 0 the capture path is designed so that the resulting object space angular field of view is as 
required (i.e. just under 20° for the application of this embodiment - see later). The 
illxmiination path is designed to produce the same object space field of view as the 
capture path, so that the illiunination fills the object space field of view with maximum 
power and uniformity. 

15 The IR LED 131 is strobed in synchrony with frame capture. The use of 

focused illumination allows both a short exposure time and a small aperture. The short 
exposure time prevents motion blur, thus allowing position tag data capture during pen 
movement. The small aperture allows sufficient depth of field for the ftiU range of 
surface depths induced by tilt. The capture path includes an explicit aperture stop 191 for 

20 this purpose. 

Because the image sensor 132 typically has a strong response throughout the 
visible and near-infrared part of the spectrum, it is preceded by an infrared filter 163 in 
the capture path so that it captures a clean image of the tag data on the surface, free from 
interference from other graphics on the surface which may be printed using inks which 
25 are transparent in the near-infrared. 

When the nib is in contact with a surface the code sensor determines its 
position and orientation relative to the surface at 100 Hz to allow accurate handwriting 
recognition (see the article by Tappert, C, C Y Suen and T Wakahara, "The State of the 
Art in On-Line Hand Writing Recognition" IEEE Transactions on Pattern Analysis and 
30 Machine Intelligence, Vol 12, number 8, August 1990, the disclosure of which is 
incorporated herein by reference). The force sensor photodiode 144 is utilized to 
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determine, relative to threshold value, whether the pen is "up" or "down". The force may 
also be captured as a continuous value, as discussed above, to allow the full dynamics of 
a signature to be verified, for example. 

The code sensor determines the position and orientation of its nib on the 
5 surface by imaging, in the infrared spectrum, an area of the surface in the vicinity of the 
nib. It decodes the nearest tag data and computes the position of the nib relative to the 
location tag from the observed perspective distortion on the imaged tag and the 
calibrated and known geometry of the implement, the attached code sensor, and the code 
sensor's optical path. Although the position resolution of the tag may be low, the 
10 adjusted position resolution is quite high, and easily exceeds the 200 dpi resolution 
required for accurate handwriting recognition (see above reference). 

As an alternative to locating and decoding tags at the required motion tracking 
rate (e.g. 100 Hz), it is possible to locate and decode tags at a significantly lower rate 
(e.g. 10 Hz), and instead use inter-frame correlation techniques to track motion. In this 

15 approach, pairs of successively captured frames are compared to identify conmion 
pattems and, from their inter-frame displacement, a motion vector. This approach can 
break down if tilt varies rapidly. It is therefore important to model the rate of change of 
tilt and to locate and decode tags at a corresponding rate. Alternatively, rapid tilt change 
can be accommodated within an inter-frame correlation scheme if the scheme actively 

20 tracks features such as tag targets from one frame to the next. The advantage of inter- 
frame correlation techniques is that they are less compute-intensive than full tag 
decoding. They may also be used with lower-resolution or smaller captured images. 

As mentioned above, movements made relative to a surface with an implement 
which has an attached code sensor are typically captured as a series of strokes. A stroke 

25 consists of a sequence of time-stamped pen positions on the surface, initiated by a pen- 
down event and completed by the subsequent pen-up event. A stroke is also tagged with 
the region ID of the surface whenever the region ID changes, i.e. just at the start of the 
stroke under normal circumstances. As discussed above, each location tag includes data 
indicative of its position on the surface and also region data indicative of the region of 

30 the surface within which the tag lies. 

Figure 16 is a diagram illustrating location tag and stroke processing in the 
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code sensor. The pen controller 134 starts with an empty stroke (at 164). It then 
continuously samples (at 165) the nib force via the force sensor photodiode 144, 
checking for a pen-down state (at 166). If the pen is in a pen-down state, the pen 
controller 134 captures (at 167) an image of the surface, locates (at 168) a tag in the 
5 captured image, decodes (at 169) tag data from the tag, infers (at 170) the pen position 
and orientation relative to the surface, and appends (at 171) the position data to the 
current stroke data. On detection of a pen-up event, i.e. on detection (at 172) of a pen-up 
state after a pen-down state as indicated by the presence of a non-empty stroke, the pen 
controller 134 encrypts (at 173) the stroke data and transmits (at 174) the stroke data via 
10 the RF chip 133 and antenna 112, to the computing system. It then starts with another 
empty stroke (at 164). 

Assuming a reasonably fast 8 bit multiply (3 cycles), the processing algorithm 
uses about 80% of the processor's time when the pen is active. 

If the code sensor is out of range of a computing system to transmit to, then it 
1 5 buffers digital ink in its intemal memory. It transmits any buffered digital ink when it is 
next within range of a computing system. When the pen's intemal memory is full the pen 
ceases to capture digital ink and instead flashes its error LED 116 whenever the user 
attempts to write with the implement which has the code sensor attached. 

Figure 17 is a diagram illustrating the structure of the raw digital ink 
20 transmitted from the code sensor to the computing system. Digital ink which is buffered 
in the code sensor when it is working offline is stored in the same form as digital ink 
which is transmitted to the system. 

When the code sensor connects to the computing system, the controller 134 
notifies the system of the pen ID, nib ID, current absolute time, and the last absolute time 

25 it obtained from the system prior to going offline. This allows the system to compute any 
drift in the pen's clock and timeshift any digital ink received from the code sensor 
accordingly. The code sensor then synchronizes its real-time clock with the accurate real- 
time clock of the system. The pen ID allows the computing system to identify the code 
sensor when there is more than one pen being operated with the computing system. Pen 

30 IDs may be important in systems which use the pen to identify an owner of the pen, for 
example, and interact with that owner in a particular directed manner. In other 
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embodiments this may not be required. The nib ID allows the computing system to 
identify which nib is presently being used. The code sensor attachment may include a 
user interface allowing the user to select a nib color and/or style which matches the 
characteristics of the implement to which the code sensor is attached. The computing 
5 system can vary its operation depending upon which nib is being used. For example, if a 
marking nib is being used the computing system may defer producing feedback output 
because immediate feedback is provided by the ink markings made on the surface. When 
a stylus nib is being used, the computing system may produce immediate feedback 
output. 

10 At the start of a stroke the pen controller 134 records the elapsed time since the 

last absolute time notified to the system. For each pen position in the stroke, in the form 
of a raw pen position 177, the controller 134 records the x and y offset of the nib from 
the current tag, the x, y and z rotation of the pen, and the nib force. It only records the tag 
ID if it has changed, in the form of a tag change 178. Since the tag frequency is 

1 5 significantly smaller than the typical position sampling frequency, the tag ID is constant 
for many consecutive pen positions, and may be constant for the entire stroke if the 
stroke is short. 

Since the code sensor samples its position and orientation at 100 Hz, pen 
positions in a stroke are implicitly clocked at 100 Hz and do not need an explicit 
20 timestamp. If the code sensor fails to compute a pen position, e.g. because it fails to 
decode a tag, it must still record a pen position to preserve the implicit clocking. It 
therefore records the position as unknown, in the form of an unknown pen position 179, 
allowing the computing system to later interpolate the position from adjacent samples if 
necessary. 

25 Since the 32-bit time offset of a stroke has a finite range (i.e. 49.7 days), the 

code sensor optionally records an absolute time for a stroke, in the form of a time change 
176. This becomes the absolute time relative to which later strokes' time offsets are 
measured. 

Since the region ED is constant for many consecutive strokes, the pen only 
30 records the region ID when it changes, in the form of a region change 1 80. This becomes 
the region ID implicitly associated with later pen positions. 
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Since a user may change the nib between one stroke and the next, the code 
sensor optionally records a nib ID for a stroke, in the form of a nib change 175. This 
becomes the nib ID implicitly associated with later strokes. 

Each component of a stroke has an entropy-coded prefix. A 10mm stroke of 1 
5 second duration spans two or three tags, contains 100 positions samples, and therefore 
has a size of about 5500 bits. Online continuous digital ink capture therefore requires a 
maximum transmission speed of 5.5 Kbps, and offline continuous digital ink capture 
requires about 40 Kbytes of buffer memory per minute. The code sensor's 512KB 
DRAM 48 can therefore hold over 12 minutes of continuous digital ink. Time, region 
1 0 and nib changes happen so infrequently that they have a negligible effect on the required 
transmission speed and buffer memory. Additional compression of pen positions can 
reduce transmission speed and buffer memory requirements further. 

Each raw stroke is encrypted using the Triple-DES algorithm (see Schneier, B, 
Applied Cryptography, Second Edition, Wiley 1996, the disclosure of which is 
1 5 incorporated herein by reference) before being transmitted to the computing system. The 
pen and computing system exchange session keys for this purpose on a regular basis. 
Based on a conservative estimate of 50 cycles per encrypted bit, the encryption of a one- 
second 5500 bit stroke consumes 0.7% of the processor's 45 time. 

The combined geometry of the code sensor attachment and the implement to 
20 which it is attached is somewhat constrained by the requirements of the optics. An 
alignment tool is therefore provided in the form of a base 1110 which simultaneously 
receives the code sensor attachment 1010 and the implement 1012, allowing the former 
to be attached to the latter in a controlled manner. The base 1 1 10 is illustrated in Figures 
18 to 22. 

25 The base consists of a base molding 1112, a metal pivot shaft 1114, an paired 

levers each consisting of a grip 1116 and an arm 1118. The paired levers wrap around 
the pivot shaft 1114 with the arms 1116 projecting inwards to form a guide which 
marries with the longitudinal slot in the code sensor attachment' s metal clamp 1020. 
The base molding is arranged to receive the code sensor attachment 1010 vertically, 

30 guided by the lever arms 1116. 

Once the code sensor attachment is seated in the base, application of force to 
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the lever grips 1118 forces the lever arms 1116 apart, thus opening the metal clamp 1020 
and readying it for receiving the implement 1012. When the implement is inserted 
vertically through the open clamp 1020, it decends imtil its nib rests on the internal floor 
of the base molding. The base molding holds the code sensor attachment in a slightly 
5 elevated position with respect to the floor, ensuring an ideal distance from the code 
sensor attachment's optics 135 and the implement's nib. 

Releasing the lever grips 1118 allows the clamp 1020 to close, firmly attaching 
the code sensor attachment 1010 to the implement 1012. The implement can then be 
removed from the base, with the code sensor attached, ready to be used. 

10 The base 1110 helps ensure gross calibration of the code sensor optics. Fine 

calibration can then be performed as follows. The geometry is as defined in our co- 
pending application USSN 09/575,154 (docket no. NPTOOIUS). 

The geometry of a pen can be automatically calibrated by determining how the 
sensed viewing distance varies with sensed pen orientation. This can be usefixl when the 
1 5 geometry is not known a priori, such as when the code sensor attachment has just been 
attached to an implement. 

Assuming a point-like rather than curved nib, the geometry is defined by a 
three-dimensional vector from the viewpoint to the nib. Assuming, as before, that the 
optical axis and the nib lie in a known pitch plane, calibration involves sensing the 
20 surface at two different pitch values, determining the lines of intersection between the 
pitch plane and the surface at these pitch angles, and then determining the point of 
intersection of these lines. The point of intersection is the point of the nib. 

Figure 23 shows sensor space with the viewpoint E at the origin and the point 
of the nib A at point (S, D), where D is the viewing distance and S is the distance from 
25 the nib to the center of the field of view window at zero pitch. 

Two lines Li and L2 indicate the surface at different pitch angles 0i and 9 2 
respectively. 

Once the surface has been imaged at a particular pitch angle, the pitch angle 0 
and viewing distance I can be inferred from the imaged tag in the usual way. 

30 Lines Li and L2 are then defined by: 
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2 = -y tan 9i + Ii 



z = —y tan 62 + I2 



Solving for y and z gives the intersection point of the two lines, i.e. the point of 



the nib: 



5 



S = y = (Ii - 12) / (tan Gi - tan 62) 



D = z = (Ii tan 02 - 12 tan Gi) / (tan Gi - tan G2) 

The radius R of a spherical nib can be determined by sensing the surface at 
three pitch angles rather than two, as described below. 



10 different pitch angles. Li and L2 intersect at Pa, while L2 and L3 intersect at Pb. Line La 
starts at Pa and has a direction given by the average of the vectors Ni and N2 normal to 
Li and L2 respectively. Similarly, line Lb starts at Pb and has a direction given by the 
average of the vectors N2 and N3 normal to L2 and L3 respectively. Lines La and Lb 
intersect at K, the center of the spherical nib. The minimum distance from K to any of 

1 5 lines Li, L2 and L3 gives the desired radius R. 

The convex hull of an arbitrary three-dimensional nib shape can be 
approximated by sensing the surface at a number of pitch and roll angles. Each pitch, roll 
and viewing distance triplet uniquely defines a 3D half-space, and the intersection of a 
collection of such half-spaces approximates the convex hull. The larger the number of 
20 orientations at which the surface is sensed, the more closely the intersection 
approximates the convex hull. 

Because the pitch and roll ranges of the pen are limited, the shape of the nib 
can only be determined within these ranges. However, since these ranges also define the 
operating ranges of the pen, they allow the nib shape to be determined to the extent 
25 required to model the interaction of the nib with the surface within the operating ranges 
of the pen. This modeling is what is required to precisely track the motion of the nib, as 
well as to model the shape of the stroke produced by the nib at various orientations. 

The calibration technique does not allow the shape of a concave nib to be 
determined. However, cavities in a nib only affect stroke shape, and not nib motion. 



Figure 24 shows three lines Li, L2 and L3 indicating the surface at three 
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Although the inference of the pen transform as described in the co-pending 
application USSN 09/575,154 (docket no. NPTOOIUS) only takes into account a 
spherical nib, it can be extended trivially to take into account an arbitrary nib shape 
defined by an exact or approximating polygon. 

5 After attaching the code sensor attachment, the user calibrates the resulting 

geometry in conjimction with a special netpage which informs the system that calibration 
is taking place. 

We turn now to altemative embodiments of the coded data and the code sensor 
attachment. 

10 hi a first altemative embodiment, the coded data is indicative of an identity 

which is data identifying the substrate. This enables the code sensor to identify the 
substrate, such as whether the substrate is a particular type of document or whether the 
substrate forms part of another object. Of course, since with this embodiment the tags are 
indicative of the characteristics of the surface and not the location of the tags relative to 

1 5 the surface, separate means are required for sensing movement of the apparatus relative 
to the surface. 

In a second altemative embodiment, the coded data is indicative of an identity 
which is data indicative of the tag type. The code sensor can then identify whether the 
tag represents an object of interest rather than a position on the surface. For example, if 
20 the tag represents an object and corresponds to a user interface input element (e.g. a 
command button), then the tag can directly identify the input element. 

A suitable separate movement sensing means for use with the altemative 
embodiments includes a pair of orthogonal accelerometers 190 mounted in a plane 
normal to the code sensor axis. The accelerometers 190 are shown in Figure 14 in ghost 
25 outline. 

The provision of the accelerometers enables this embodiment of the code 
sensor to sense motion without reference to surface tags, allowing the tags to be sampled 
at a lower rate. 

The acceleration measured by the accelerometers in each of the X and Y 
30 directions is integrated with respect to time to produce an instantaneous velocity and 
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position. 

Since the starting position of the stroke is not known, only relative positions 
within a stroke are calculated. Although position integration accumulates errors in the 
sensed acceleration, accelerometers typically have high resolution, and the time duration 
5 of a stroke, over which errors accumulate is short. 

Listead of providing accelerometers to sense motion relative to the surface, 
altemative motion sensing means may be provided. Such means include motion sensing 
means which includes an optical sensor which cooperates with the surface to generate 
signals indicative of movement of the optical sensor relative to the surface, motion 
1 0 sensing means which includes at least two contacts arranged to contact the surface and 
sense movement in two orthogonal directions, or any other suitable motion sensing 
means for sensing movement relative to a surface. 

The present invention has been described with reference to a preferred 
embodiment and number of specific altemative embodiments. However, it will be 

1 5 appreciated by those skilled in the relevant fields that a number of other embodiments, 
differing firom those specifically described, will also fall within the spirit and scope of 
the present invention. Accordingly, it will be imderstood that the invention is not 
intended to be limited to the specific embodiments described in the present specification, 
including documents incorporated by cross-reference as appropriate. The scope of the 

20 invention is only limited by the attached claims. 
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